Zipf and non-Zipf Laws for Homogeneous Markov Chain

نویسندگان

  • Vladimir V. Bochkarev
  • Eduard Yu. Lerner
چکیده

Let us consider a homogeneous Markov chain with discrete time and with a finite set of states are nonrecurrent. The goal of this work is to study frequencies of trajectories in this chain, i.e., " words " composed of symbols E 1 ,. .. , E n ending with the " space " E 0. Let us order words according to their probabilities ; denote by p(t) the probability of the tth word in this list. In this paper we prove that in a typical case the asymptotics of the function p(t) has a power character, and define its exponent from the matrix of transition probabilities. If this matrix is block-diagonal, then with some specific values of transition probabilities the power asymptotics gets (logarithmic) addends. But if this matrix is rather sparse, then probabilities quickly decrease; namely, the rate of asymptotics is greater than that of the power one, but not greater than that of the exponential one. We also establish necessary and sufficient conditions for the exponential order of decrease and obtain a formula for determining the exponent from the transition probability matrix and the initial distribution vector. Index Terms —– Time-homogeneous Markov chain with a finite state space, power laws, analytic information theory, monkeys typing randomly, exponential laws, rank-frequency distribution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Strong, Weak and False Inverse Power Laws

Pareto, Zipf and numerous subsequent investigators of inverse power distributions have often represented their findings as though their data conformed to a power law form for all ranges of the variable of interest. I refer to this ideal case as a strong inverse power law (SIPL). However, many of the examples used by Pareto and Zipf, as well as others who have followed them, have been truncated ...

متن کامل

Zipfian and Lotkaian

This paper studies concentration (i.e. inequality) aspects of the functions of Zipf and of Lotka. Since both functions are power laws (i.e. they are – mathematically the same) it suffices to develop one concentration theory for power laws and apply it twice for the different interpretations of the laws of Zipf and Lotka. After a brief repetition of the functional relationships between Zipf’s la...

متن کامل

Zipf and Heaps Laws' Coefficients Depend on Language

We observed that the coefficients of two important empirical statistical laws of language – Zipf law and Heaps law – are different for different languages, as we illustrate on English and Russian examples. This may have both theoretical and practical implications. On the one hand, the reasons for this may shed light on the nature of language. On the other hand, these two laws are important in, ...

متن کامل

Co-occurrence of the Benford-like and Zipf Laws Arising from the Texts Representing Human and Artificial Languages

We demonstrate that large texts, representing human (English, Russian, Ukrainian) and artificial (C++, Java) languages, display quantitative patterns characterized by the Benford-like and Zipf laws. The frequency of a word following the Zipf law is inversely proportional to its rank, whereas the total numbers of a certain word appearing in the text generate the uneven Benford-like distribution ...

متن کامل

Zipf Law and the Firm Size Distribution: a critical discussion of popular estimators

The upper tail of the firm size distribution is often assumed to follow a Power Law behavior. Recently, using different estimators and on different data sets, several papers conclude that this distribution follows the Zipf Law, that is that the fraction of firms whose size is above a given value is inversely proportional to the value itself. We compare the different methods through which this c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1207.1872  شماره 

صفحات  -

تاریخ انتشار 2012